Reliable Data Processing Enabled By Program Analysis
نویسندگان
چکیده
Bao, Tao Ph.D., Purdue University, December 2014. Reliable Data Processing Enabled by Program Analysis. Major Professor: Xiangyu Zhang. Errors pose a serious threat to the output validity of modern data processing, which is often performed by computer programs. In scientific computation, data are collected through instruments or sensors that may be exposed to rough environmental conditions, leading to errors. Furthermore, during the computation process data may not be precisely represented due to the limited precision of the underlying machine, leading to representation errors. Computational processing of these data may hence produce unreliable output results or even faulty conclusions. We call them reliability problems. We consider the reliability problems that are caused by two kinds of errors. The first kind of errors includes input and parameter errors, which originate from the external physical environment. We call these external errors. The other kind of errors is due to the limited representation of floating point values. They occur when values cannot be precisely represented by machines. We call them internal representation errors, or internal errors. They are usually at a much smaller scale compared to external errors. Nonetheless, such tiny errors may still lead to unreliable results and serious problems. In this dissertation, we develop program analysis techniques to enable reliable data processing. For external errors, we propose techniques to improve the sampling efficiency of Monte Carlo methods, namely execution coalescing and white-box sampling. For internal errors, we develop efficient monitoring techniques to detect instability problems at runtime in floating point program executions.
منابع مشابه
Development of an integrated program of sensory rehabilitation based on vibroacoustic and virtual reality and its effectiveness on the profile of auditory processing in children with autism spectrum disorder: A Case study
Introduction: People with autism spectrum disorder have sensory abnormalities in addition to social interactions, communication skills, limited interests and stereotyped behaviors. Therefor the present study conducted with the aim of development of an integrated program of sensory rehabilitation based on vibroacoustic and virtual reality and its effectiveness on the profile of auditory, in chil...
متن کاملO-3: Drug Repositioning by Merging Gene Expression Data Analysis and Cheminformatics Target Prediction Approaches
The transcriptional responses of drug treatments combined with a protein target prediction algorithm was utilised to associate compounds to biological genomic space. This enabled us to predict efficacy of compounds in cMap and LINCS against 181 databases of diseases extracted from GEO. 18/30 of top drugs predicted for leukemia (e.g. Leflunomide and Etoposide) and breast cancer (e.g. Tamoxifen a...
متن کاملAsynchronous Communication for Finite-Difference Simulations on GPU Clusters using CUDA and MPI
Graphical processing Units (GPUs) are finding widespread use as accelerators in computer clusters. It is not yet trivial to program applications that use multiple GPU-enabled cluster nodes efficiently. A key aspect of this is managing effective communication between GPU memory on separate devices on separate nodes. We develop a algorithmic framework for Finite-Difference numerical simulations t...
متن کاملDecentralized spatial data mining for geosensor networks
Advances in distributed sensing and computing technology offer new, reliable, and costeffective means to collect fine-grained spatiotemporal data. Conventional spatiotemporal data mining procedures, however, are based on centralized models of information processing, where sophisticated and powerful central systems collate and process global information. By contrast, decentralized spatial comput...
متن کاملComputational and Data-Enabled Analysis for Sustainable Transportation Systems
16. Abstract Transportation planners and traffic engineers are faced nowadays with immense modeling challenges arising from several emerging policy, planning, and engineering developments. In fact, the recent emergence of mobile sensing and traffic monitoring technology has provided an unprecedented amount of information and data for traffic analysis, demanding the adaptation of mathematical an...
متن کامل